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Abstract 


In this paper, we apply set partitioning to multi-dimensional signal spaces over GF(q ), 
particularly GF q ~ l {q) and GF q (q), and show how to construct both multi-level block codes 
and multi-level trellis codes over GF(q). We present two classes of multi-level (n, k,d) block 
codes over GF(q) with block length n, number of information symbols k, and minimum 
distance dmin > d, where n = nin 2 ,k — n — 1 min { — 1, ^2}^ n\ = q — 1 ot q, 

n 2 = q — l, q, or q + 1, and |V| is the smallest integer larger than or equal to x. These 
two classes of codes use Reed-Solomon codes as component codes. They can be easily 
decoded as block length q — 1 Reed-Solomon codes or block length q or q + 1 extended 
Reed-Solomon codes using multi-stage decoding. Many of these codes have larger distances 
than comparable <7- ary BCH codes. Longer block codes can be constructed by using q- ary 
BCH codes, or other <7- ary block codes, as component codes. Low rate <7- ary convolutional 
codes, word error-correcting convolutional codes, and binary-to-g-ary convolutional codes 
can also be used to construct multi-level trellis codes over GF(q) or binary-to-g-ary trellis 
codes, some of which have better performance than the above block codes. All of the new 
codes have simple decoding algorithms based on hard decision multi-stage decoding. 
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1 Introduction 


In this paper, we combine multi-level coding with set partitioning of multi- dimensional signal 
spaces to construct several new classes of block and trellis codes over GF(q). Many of these 
codes have a better trade-off of minimum distance, information rate, and decoding complexity 
than previously known q- ary codes. A simple, fast decoding algorithm based on hard decision 
multi-stage decoding is also presented. 

The technique of multi-level coding has been introduced in several recent papers [ 1 - 6 , 
10 , 11]. Most researchers have considered the case where the signals are points in an N- 
dimensionalEuclidean space and the codes are designed to maximize the minimum Euclidean 
distance. Binary block or convolutional codes which maximize the minimum Hamming 
distance have been used as component codes to construct multi-level codes based upon 
binary (two-way) set partition chains. However, little work has been done on multi-level 
codes based upon q- way ( q > 2) set partition chains, which require the use of 9-ary codes 
as component codes. In the following sections, we will apply set partitioning to multi- 
dimensional signal spaces over GF(q ), particularly GF q ~ l (q) and GF q (q), and show how to 
construct both multi-level block codes and multi-level trellis codes over GF(q). These new 
codes use 9-ary block and convolutional codes as component codes. 

In Section 2 , we construct two 9- way set partition chains for GF q ~ 1 (q) and GF q (q) by 
using Reed-Solomon codes and shortened extended Reed-Solomon codes, respectively. Based 
on these set partition chains, in Section 3 we construct two classes of multi-level ( n,k,d ) 
block codes over GF(q) with n = nin 2 and k = n — min { — li n 2}i where 

ni = q — 1 or 9, n 2 = 9 — 1, 9, or 9 4- 1, and [af| is the smallest integer larger than or 
equal to x. (Throughout the paper, an ( n,k,d ) block code means that the code has block 
length n, number of information symbols k , and design distance d , which may be less than 
the minimum distance d m ; n of the code.) These two classes of codes use Reed-Solomon 
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codes as component codes and have the following advantages over < 7 -ary BCH codes and 
Reed-Solomon codes: 

1 . Block lengths of order q 2 can be achieved, as opposed to block lengths of order q using 
Reed-Solomon codes. 

2. For the same Hamming distance, many of these codes have higher information rates 
than q - ary feCH codes. 

3. Since these codes have a multi-level structure, they can be simply decoded using hard 
decision multi-stage decoding of the component Reed-Solomon codes. 

In Section 4, we use < 7 -ary convolutional codes as component codes to obtain a class of 
( 7 - ary trellis codes with higher information rates than the two above classes of block codes. 
In Section 5 , we present another class of codes, binary-to-g-ary trellis codes, which provide 
more trade-offs between information rate and decoding complexity for the same minimum 
distance. 

Although this study of multi-level codes over GF(q) is motivated by the problem of 
finding multi-level codes based on higher-way set partition chains for QAM and PSK signal 
constellations, the new codes are interesting in their own right and can be used to correct 
both random errors and burst errors if the channel symbols are elements in a subfield of 

GF(q). 

2 The Set Partition Chain of GF q ~ l {q ) and GF q (q ) 

The new codes use a multi-level construction based on set partition chains of the multi- 
dimensional signal spaces GF q ~ l (q) and GF q (q). The purpose of this section is to construct 
these two set partition chains. In the following, we use both polynomials and vectors to 


3 



represent codewords, i.e., the polynomial representation of the codeword (c 0 , Cj, . . . , c n _i) is 
c 0 + c x x H 1- c n _ix n_1 . 

For simplicity, and without loss of generality, suppose the generator polynomial of the 
{ n \i n i —*,*’ + 1 ) Reed-Solomon code over GF(q ), donated by RS(i), is 

9i(x) = (x - l)(x - 0) ... (x - d'" 1 ), i = 1, 2, . . . , q - 1, (1) 


where nj is equal to <7 — 1 and d is a primitive element of GF(q). In particular, let RS( 0 ) = 
GF g ~ 1 (q). Also, let the minimum distance of a single point in the set GF g ~ l (q) (a single 
codeword in PS^O)) be 00. Next define 


pt \ A (x- l)(x-d)...(x-d’ ]) 
A ’ {d i - l)(d' - d )(d'' - 3 - 1 ) : 


i = 1,2 ,...,?- 1, 


( 2 ) 


and 


P 0 (x) = 1. 


( 3 ) 


Lemma 1 Fori = 0, 1, . . . ,q — l, if C%{x) is a code polynomial in RS(i), i.e., C,(x) E RS(i), 
then C,(x) - P x {x)Ci{d') E RS(i + 1). 

This can be easily proved by showing that are roots of Ci(x ) — Pi(x)Ci(d t ), 

i.e., ^j +1 (x)|(7j(x) — P t (x)Ci(d'). The next lemma follows directly from Lemma 1. 

Lemma 2 For any i = 0, l,...,qr — 1, and for any arbitrary Ci(x ) E RS(i), Cfx) can be 
uniquely expressed as 

C»'(x) = Pi(x)Ci(d') + C t+ i(x), (4) 

where C{+ i(x) E RS(i + 1). In other words, P,(x)y, y E GF(q), generates q coset represen- 
tatives of RS(i + 1) in RS(i). 

From the above two lemmas, we have 


4 



Theorem 1 GF^~ l \q) = RS(0)/RS(l)/ . . . /RS(q - 2)/RS(q - 1) = {0} is a set partition 
chain with Hamming distances 1/2/ ... /q — l/oo. 

Multi-level codes or other coset codes based on higher dimensional signal sets can some- 
times achieve larger coding gains than codes based on lower dimensional signal sets. This 
motivates us to also construct a set partition chain for GF q (q ), corresponding to shortened 
extended Reed-Solomon codes. The extended Reed-Solomon codes have two more informa- 
tion symbols than the Reed-Solomon codes while maintaining the same minimum distance 
and number of redundant symbols [12]. But we cannot construct a set partition chain for 
GF q + l (q). To obtain a set partition chain for GF q (q ), we use shortened extended Reed- 
Solomon codes, which are obtained by dropping the last symbol of extended Reed-Solomon 
codes. These codes can be defined as follows. 

Definition 1 Let d be an arbitrary integer. A shortened extended Reed-Solomon code is a 
linear code over GF(q) of block length n = q whose codewords (c_,co, ci,. . . , c g _ 2 ) have the 
following properties: 

1. (co,Ci, . . . , c q - 2 ) is a codeword of a (q — 1, q — d + 1, d — 1) Reed-Solomon code with 
generator polynomial 

G(x) = (x- \)(x - d 1 ) ■ ■ ■ {x - d d ~ 3 ) (d > 3), (5) 

where d is a primitive element of GF(q ) and G(x) = 1 for d = 2; 

2 . 

c_ = Co + C\d 1 + c-}d 2 + . . . + c 9 _ 2 d ^ 2 ^. (6) 

Again without loss of generality, let RS'( 0) = GF q (q) and RS'(i) be a shortened extended 
Reed-Solomon code with d = i + 1, for i = 1, 2, . . . , q — 1. The next lemmas are similar to 
Lemmas 1 and 2 and follow directly from Blahut [12]. 
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Lemma 3 The minimum Hamming distance of RS*(i) is i+ 1, for i = 0, 1, 2, . . . , q — 1, and 
RS'{i ) D RS'{i+ = 

Lemma 4 7. For any arbitrary codeword Cq = (cl, Cq, - . . , c'_ 2 ) 6 can 6e 

uniquely expressed as 


q-2 


C' = (ci-^c'd-\0A...,0) + C[, 

j=o 


(7) 


ta/zere E RS'(l) and d~ J = d q ~ 1 ~ j . That is, (y, 0,0, --,0), y e ( 7 F(< 7 ), yeneraies <7 
cosei reyreseniaFres o/CJ m (7 q. 

2. For any arbitrary codeword C[ E F5'(z), i = 1,2 , ...,<7 — 1, C' can be uniquely 
expressed as 


r i - 1 


C,' = + c' +1 


( 8 ) 


where C' +1 e RS'(i + 1), Ci-i 6 RS(i - 1), CJ = ( c.,Ci ), y = O t _ 1 (<9‘- 1 ), and the 
= 0,1,...,* — 1) are i/*e coefficients of 

Pi-i(x) = Po ] x + ... + pfiaX 1-1 . (9) 

That is, (Ej^oPj’^^P^Po V.Pi^P, • • • , Pi-iP, 0, 0, • • ■ , 0), y e GF(q), generates q coset 
representatives of C t - +1 in (7,-, i = 1,2, • • • , q — 1. 

These two lemmas lead to 

Theorem 2 GF g (q) = RS'(0)/RS'(1)/ . . . /RS'(q — l)/RS'(q) = {0} is a set partition chain 
with Hamming distances 1/2/ . . . /q/oo- 

3 Constructions of Block Codes Over GF(q ) 


The general structure of multi-level codes has been described in many references [1-6, 10, 
11 ]. Here we briefly discuss the principle of encoding for multi-level codes based on a q- way 
set partition chain. 
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Suppose H 0 is a signal set in a multi-dimensional space over GF(q) and it generates a 
group under some operation, for example, addition in GF(q). For i = 1,2,.. . ,m, Hi is a 
subgroup of 1 , and H m contains a single element of the space. The coset representative 
of Hi - 1 in Hi is denoted by [Hi-\/ H/\, and the number of cosets is = q, for 

i = 1,2, ... ,m. From the theory of basic algebra, Ho can be expressed as 

m 

Ho = £[#-,/#]. (10) 

1 = 1 

Thus we have a partition chain Ho/ Hi/ . . . / H m with distances A 0 /Ai / . . . /A m _i/oo, where 
A i is the minimum subset distance of Hi under the distance metric in Ho, i.e., Hamming 
distance. 

Figure 1 shows the structure of an encoder for a multi-level code based on the set par- 
tition chain H 0 /H i / . . . / H m = {0}, where F, is the encoder corresponding to code C, with 
information rate Ri and minimum free Hamming distance d,-, i = 0, 1, . . . , m — 1. The cod- 
ing procedure is as follows: First, the information sequence is partitioned into m component 
information sequences having rates R 0 , R \, . . . , R m - 1 , (0 < Ri < 1, i — 0, 1, . . . , m — 1). 
The i th component information sequence enters encoder F,-, for i = 0, 1, . . . , m — 1. In prin- 
ciple, code Ci may be any kind of code with output symbols over GF(q). Each output 
symbol of F t - selects a coset of In this section, we only discuss the case where every 

component code is a block code. In the following two sections, we will show how to improve 
the information rate by using convolutional codes as component codes. 

Suppose Ci is a block code with block length n 2 , for i = 0,1,..., m — 1. Let /,(x) 
be an information polynomial of encoder F, and Yi(x) be a code polynomial in C,, i = 
0, 1, . . . , m — 1. For i = 0, 1, . . . , m — 1, Yi(x) can be expressed as 

= J/o } + y{ ,) * + ■ • • + yn]-ix m ~ l ( 11 ) 

For fixed j, each specifies a coset of F,+i in Hi, for i = 0, 1, . . . , m — 1. By (10), an unique 
point in H 0 , denoted by Sj, j = 0, 1, . . . , n 2 - 1, is specified by yf* (i = 0, 1, . . . ,m — 1). So 
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the multi-level block code is the set of output signals 

s = {(So, Si , . . . , S nj -i) : Y(x) eCi, i = 0, 1, . . . , m - 1} , (12) 

where Si £ H 0 , i = 0, 1, . . . ,n 2 — 1. From references [2-5, 11], a lower bound exists on the 
minimum distance of the code: 

dmin > d. = min {A id,-, 0 < i < m — 1} . (13) 

Therefore, to construct a code, we must choose a suitable set partition chain and a set 
of component codes. The two classes of multi- dimensional signals introduced in Section 
2 provide good choices for set partition chains, and Reed-Solomon codes, extended Reed- 
Solomon codes, and shortened extended Reed-Solomon codes can be used as component 
codes. Now we discuss these codes in detail. 

Construction A. This class of codes is based on the set partition chain RS(0)/RS(1)/ . . . / RS 
(q — 2)/{0). To construct a code with design distance d (d < q — 1), we use component codes 
Ci = — 1), for i = 0, 1, . . . ,d — 2, and C, = RS’(O), for d — 1 < i < q — 2, where [Y| 

is the smallest integer larger than or equal to x. According to (13), this code has minimum 
distance > d> d. A codeword in this code can be expressed as follows: 

Y t (x) = /«(x)^ r?; i_ 1 _ 1 (x) 

= J/0 } + + • •• + 4-2 a;,-2 > for * = 0,1,... ,d -2, (14) 

and Yi(x) = U(x) = + y[ l) x -(- (- yl'l 2 x q ~ 2 , for i = d - 1, . . . , q - 2, (15) 

where y^ £ GF(q), for i = 0, 1, . . . , q — 2, j = 0, 1, . . . , q — 2. It follows from Lemma 2 that 
the j th (q — l)-tuple in a multi-level codeword can be written as 

(16) 

t=0 

Thus, a (q — l) 2 -tuple codeword can be expressed as 

7-2 

s(x ) = y; x^ q ~^s^(x) 

j=0 
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( 17 ) 


j — 0 »=0 


In this case, n = (<7 — l) 2 , k = n — X2f=i [fl — {d — 1), and the minimum Hamming distance 


dmin > d. 


To construct codes with design distance d(q— l<d<(q— l) 2 ), let C, = {0}, i.e., 
ki = 0, for i < (" ~y! ~ 1> a nd C, = RS (Tt+iI — l)> for * > f — 1. In this case, Y, (x) = 0 
for i < ~ an d ^(x) can t> e written as in (14) for i > [^yl — 1. The number of 

information symbols in a codeword is 

+r1 _i)l = 9 ( 9 _r^- 


q- 2 

*- E 


(’-D-fr-rri- 1 ) =»(«-rrrri)- E (is) 

f-fAi-1 L + J q i-rii-1 


Other block codes over GF(q ), such as extended Reed-Solomon codes, shortened extended 
Reed-Solomon codes, and BCH codes over GF(q ) can also be used as component codes. In 
general, if a code polynomial in C{ is 


Ti(x) = y { 0 ' ] + y^x + • • • + yi^x” 2 \ for i = 0, 1 , ... ,q - 2 , 


(19) 


then a codeword in the multi-level code can be expressed as 

712 - 19-2 

S(x) = J2 ^x j(,_1) F i (a:)yj ,) . 

j - 0 t =0 


( 20 ) 


If we take only Reed-Solomon codes, shortened extended Reed-Solomon codes, and extended 
Reed-Solomon codes as component codes, we obtain block codes over GF(q ) having the 
following parameters: 


block length n = (q — l) 2 , q(q — 1), or q 2 — 1, 


number of information symbols 


* = "-Er-l-(<i-l), (d <*/(<!- 1)) 

«= 1 * 
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or 


Jt=(-2 T + l)( 9 -r^^ll)- e’ r-l. (d>n/(q- 1 )) 
9- 1 V " / i=f £^ii 1 * 


and minimum Hamming distance d^n > d (d < n). 


Construction B. This class of codes is based on the partition chain RS r (0)/ RS'(1)/ . . . / RS' 
(q — l)/{0}. Similar to Construction A, a code polynomial in component code C{ can be 
expressed as 

Y,(x) = + y[ i] x + ■ ■ ■ + for i = 0, 1, . . . , q - 1. (21) 


From Lemma 4, the j th q - tuple in a multi-level codeword can be written as 

s«w 4 »f + g g rfWf + x E 


(22) 


«=1 *=1 »=1 
Thus a codeword in the multi-level code can be expressed as 

Using Reed-Solomon codes, shortened extended Reed-Solomon codes, and extended Reed- 
Solomon codes as component codes, we can obtain block codes over GF(q ) having the fol- 
lowing parameters: 



712—1 

s(x) = E 

J =0 


X 


jq 


<?- 1 t-i 


7-1 


yf ] + E E pfyfd ~ k + * £ 1 


t=l k=o 


t=l 


block length n = q(q — 1 ),q 2 , or q(q + 1) 


number of information symbols k = n 



(d < n/q) 


or 

* = (7+o(«-r$i + i)- e 171. («'>"/?) 

" n ' 

and minimum Hamming distance d „ ^ > d (d < n). 

In summary, we have constructed two classes of block codes over GF(q ) having the 

following parameters: 
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a) block length n = niri 2 , where n\ = q — 1 for construction A and q for construction 
B, and n 2 — q — l,q, or q + 1, which is the block length of Reed-Solomon codes, shortened 
extended Reed-Solomon codes, and extended Reed-Solomon codes, respectively. 

b) number of information symbols k = n — 1 min f 7+rl — 1 , n 2 j- 

c) minimum Hamming distance d^n > d. 

Table 1 shows all the codes in the above two classes over GF( 4) with d mm > 3. Table 2 
shows some construction B codes over GF(S) with block length 72 and minimum distance 
from 3 to 15, where the redundancy of the component codes is pi = min { — l,9j ,i = 
0, 1, . . . , 7, and the total redundancy is p = YlJ=o P- 

Example 1. The (20, 9, 8) construction B code shown in Table 1 is based on the set partition 
chain GF\ 4) = RS'(0)/RS , {1)/RS'{2)/RS'{3)/RS , (4) = {0} with distances l/2/3/4/oo. 
It contains four extended Reed-Solomon codes as component codes: Co = (5,0,oo),Ci = 
(5, 2,4),C 2 = (5,3,3), and C 3 = (5,4,2). It has a higher information rate (» vs - y|) and a 
larger minimum distance (8 vs. 7) than the (15, 6, 7) BCH code over GF( 4). 

Table 3 presents a comparison between BCH codes and the new codes over GF( 4). 
Although some codes shown in Table 1 are not as good as BCH codes, there exist many new 
codes better than BCH codes. Moreover, the decoding complexity of these new codes is less 
them the BCH codes. 


Table 1. Codes over GF{ 4) (d > 3) 


(9,6,3)' 

(9,4,4)' 

(9,3,6)' 

(9,1,9)' 

(15,12,3)' 

(15,10,4)' 

(15,8,5)' 

(15,7,6)' 

(15,5,8)' 

(15,4,9)' 

(15,3,10)' 

(15,2,12)' 

(15,1,15)' 

(12, 9, 3) 2 

(12, 7, 4) 2 

(12, 5, 6) 2 

(12, 3, 8) 2 

(12, 2, 9) 2 

(12, 1, 12) 2 

(16, 13, 3) 3 

(16, 11, 4) 3 

(16, 8, 6) 3 

(16, 6, 8) 3 

(16, 4, 9) 3 

(16,3, 12) 3 

(16,1, 16) 3 

(20, 17, 3) 3 

(20, 15, 4) 3 

(20, 12, 5) 3 

(20,11, 6) 3 

(20, 9, 8) 3 

(20, 7, 9) 3 

(20, 6, 10) 3 
(20, 1,20) 3 

1 Construction A; 2 Construction A or 

(20,5, 12) 3 (20,3, 15) 3 

B; 3 Construction B 

(20,2, 16) 3 
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Table 2. (72, 72 - p, d ) Codes over GF( 8) (3 < d < 15) 

d 3 4 5 6 7 8 9 10 11 12 13 14 15 

p 0 2345678999999 
Pl 112233445566 6 

p 2 0111222333444 
^ 3 0011 1 1 222233 3 

p 4 0001 1 1 1 1 2222 2 

/ 3 5 0000 1 1 1 1 1 1222 

p 6 0000011111112 
p 7 0000001 1 11 111 

p 3 5 8 10 14 16 20 22 24 24 28 28 29 


Table 3. Comparison between BCH codes and the new codes over GF( 4) 

BCH [12] (15,11,3) (15,9,5) (15,6,7) (15,4,9) (15,3,11) 

New (20, 17, 3) (20, 12, 5) (20, 9, 8) (20, 7, 9) (20, 5, 12) 

Note that sometimes the actual lower bound distance d is larger than the design distance 
d. For example, in Table 2, both d = 11 and d = 12 lead to the same code (^ = 12). The 
following theorem gives the relationship between d and d- 

Theorem 3 For the above two classes of codes, 

( 1) ford<n \ , d-d. 

(2) for d > m, if there exists an i £ I = {[^-], + 1 so that i divides d, 

then d = d, and if for all i £ I, i does not divide d, then d> d. 

Proof: (1) This follows from the fact that [|1 = ^ an< ^ f rom (13). 

(2) Suppose io £ I divides d. Then d = r^l*o = d. If for all i 6 I, i does not divide d, 
then > d+ 1 for all i G /. By (13), d > d + 1. QED 

Corollary 1. For d > n x , if d is a prime, then the above ( n,k,d ) codes have a lower bound 
distance d>d+ 1 . 
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4 Constructions of Trellis Codes Over GF(q ) 


The idea of applying set partitioning to trellis coding over finite fields was discussed in 
references [13, 14]. Convolutional codes have the advantage of achieving large free distances 
by increasing complexity without decreasing information rate. For this reason, with q - ary 
convolutional codes, we can construct q - ary multi-level trellis codes with the same minimum 
distance and higher information rates than the above block codes. 

High rate convolutional codes over GF(q)(q > 2) are difficult to find, and their decoding 
complexities are high because they are powers of q. Most of the work on constructing 
convolutional codes over GF(q) has focused on low rate codes. Recently, Ryan and Wilson 
[15] have constructed some optimal low rate convolutional codes over GF(q). In this section, 
we will show how to construct high rate multi-level trellis codes over GF(q) by using good 
known low rate convolutional codes with reasonable decoding complexities as component 
codes. We should point out that, from the multi-level coding point of view, the finite state 
codes found in [13] can be viewed as one level codes based on set partitioning of multi- 
dimensional signal spaces over GF(q). 

To construct high rate multi-level trellis codes, we use low rate q - ary convolutional codes 
to replace some of the low rate q - ary block codes as component codes. It is not necessary to 
replace every block component code with a convolutional code. The idea of using a mixture 
of block and convolutional codes at different levels has appeared in [4, 6, 11]. 

Let Ri be the rate of component code C, ( i = 0,1,..., m — 1). Then the rate of the 
overall code is 

< 24 ) 

m i=o 

Since some component codes are convolutional codes, we will use the free Hamming distance 
(df T ee) instead of the minimum Hamming distance (d^n) over a block. The free Hamming 
distance of a trellis code is defined to be the minimum Hamming distance between all pairs 
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of distinct code sequences. In this case, we have a lower bound on the free distance: 

d free > d = min {A,d/(i), 0 < i < m — 1} (25) 

where dj(i) is the free (or minimum) distance of the convolutional (or block) code C;, for 
i = 0, 1, . . . , m — 1. 

Example 2. We will take the (20, 9, 8) code from Table 1 and show how to improve its 
information rate by using convolutional codes. As shown in example 1, the component codes 
are C Q = (5,0, oo), C\ = (5,2,4), C 2 = (5,3,3), and C 3 = (5,4,2). Now replace C 0 by a 
rate 1/2 4-ary convolutional code with free distance 8 and 64 states and C\ by a rate 1/2 
4-ary convolutional code with free distance 6 and 16 states. Thus the trellis code has rate 
R — ~ -f 2 "I" | “I 5 ) = f and free distance 8, whereas the (20, 9, 8) block code also has 
minimum distance 8, but its rate is only 9/20. 

From this example, we can give the following principles for constructing multi-level trellis 
codes with design distance d from the above block codes. If a block component code C, has 
rate less than 1 /2, we consider a rate 1 / 2 convolutional code with free distance larger than or 
equal to d/A,- instead of the block component code; otherwise, retain the block component 
code. If the constraint length of the rate 1/2 convolutional code is too large, implying that 
the decoding complexity is too great, and if the rate of the block component code C, is less 
than 1/3, we can consider a rate 1/3 convolutional code with free distance larger than or 
equal to d/Aj as a candidate to replace the block component code. Other replacement codes 
can be found in a similar way. 

Table 4 lists some codes obtained from the block length 20 codes listed in Table 1, where 
Ri is the rate of component code C,, K{ is the constraint length of convolutional code (7,, and 
dj(i) is the free (or minimum) distance of convolutional (or block) code Ci, for i = 0, 1,2,3. 
The memory order of convolutional code C, is = K% — 1, i.e., the number of states of code 
Ci is 4 a,_1 . To compare with block codes, we also use the notation (n, &, d) for the multi- 
level trellis codes constructed, where n is the block length of the block component codes and 
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k = nR is not necessarily an integer. Compared to the construction B block codes listed in 
Table 1, the codes in Table 4 exhibit a clear improvement in information rate. For example, 
the block code of block length 20 and minimum Hamming distance 12 has information rate 
1/4, whereas the trellis code with free Hamming distance 12 achieves an information rate of 
29/60, or close to 1/2. 


Table 4. Trellis codes over GF( 4) (6 < d < 12) 


Trellis code 

Ri 

I<i 

<*/(•') 


1/2-C 

3 

6 


3/5-B 

- 

3 

(20, 13 J, 6) 

4/5-B 


2 


4/5-B 

— 

2 


1/2-C 

4 

8 


1/2-C 

3 

6 

(20, 12, 8) 

3/5-B 

- 

3 


4/5-B 

— 

2 


1/2-C 

5 

9 


1/2-C 

3 

6 

(20, 11, 9) 

3/5-B 

- 

3 


4/5-B 

— 

2 


1/3-C 

3 

9 


1/2-C 

3 

6 

(20,10|,9) 

3/5-B 

- 

3 


4/5-B 

— 

2 


1/3-C 

4 

12 


1/2-C 

3 

6 

(20, 9|, 12) 

1/2-C 

3 

6 


3/5-B 

- 

3 


Note: In the table, the letter B denotes a block code over GF( 4) (an extended Reed-Solomon 
code), and C denotes a convolutional code over GF( 4) from [15]. 
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5 Constructions of Binary-to-g-ary Trellis Codes 


In many practical systems, the output of the information source is binary symbols, and the 
channel signals can be viewed as symbols in GF( 2 n ). Here are two examples: 

1. For multi-level codes based on multi-dimensional signal constellations [10, 16-18], sup- 
pose Hi and i/,+i are subsets of the signal set Ho and Hi D Hi+ 1. Then the cosets of 
Hi/Hi+i are usually isomorphic to GF(q ), where q is the number of cosets in 

So the outputs of the component encoder Ei can be viewed as elements in GF(q), but 
the inputs of the encoder are binary symbols. 

2. Piret [19] suggested a class of convolutional codes called word error-correcting codes. 
His word-error-correcting codes use word weight instead of Hamming weight as the dis- 
tance measure. Consider an (n, k ) convolutional code with k input bits and n output 
bits at each time interval. The n output bits are called a word, and if they are not 
all zeros, the word weight is 1. The word distance between any two code sequences 
is the word weight of the difference between these two code sequences. The minimum 
free word distance of a binary convolutional code is defined to be the minimum word 
distance between all pairs of distinct code sequences. Alternately, if we view a word 
as a symbol over GF( 2"), an ( n,k ) convolutional code is actually a £-input, 1-output 
binary-to-2 n -ary convolutional code, i.e., it is a special class of binary-to-^-ary convo- 
lutional codes. 

Ryan and Wilson [15] presented some optimal low rate binary-to-g-ary convolutional 
codes for q = 4,8, and 16. In their paper, the rate was defined as the number of input bits 
divided by the number of output symbols. We call this the binary-to-^-ary rate, denoted by 
Rb >q . If the binary-to-g-ary rate of a k input bit, n output symbol convolutional code is k/n , 
then the normalized rate is defined as R = fc/(nlog 2 <7). We will use the normalized rate to 
compare with the codes of the previous section. 
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The encoder structure of binary- to-^-ary trellis codes is still as shown in Figure 1. Many 
kinds of codes, including binary-to-g-ary codes, word-error-correcting codes, and codes over 
GF(q) can be used as component codes. (Codes over GF(q) can be chosen as component 
codes because any code over GF(q) can be viewed as a binary-to-q-ary code as long as each 
input symbol is viewed as log 2 q bits. In this sense, trellis codes over GF(q) are a special 
case of binary-to-q-ary trellis codes.) Because the free distance of a code depends only on 
the structure of the code sequences rather than on the input sequences, the inequality of 
(25) also holds for binary-to-g-ary trellis codes. 

As in the previous section, we list codes corresponding to the block length 20 codes listed 
in Table 1. Note that here the number of state is 2 Ki_1 rather than 4 K '~ l . Comparing Table 
4 to Table 5, one finds that the codes listed in Table 5 have less decoding complexity and 
a lower information rate. Therefore the codes in Table 5 offer additional trade-offs between 
information rate and decoding complexity. 

Note that the two (20,13^,6) codes have the same parameters in both tables, but the 
Table 5 code is better because the binary-to-4-ary convolutional code Co has a better distance 
distribution (fewer nearest neighbors) than the convolutional code over GF( 4). 
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Table 5. Binary-to-4-ary Trellis Codes (6 < d < 12) 


Trellis codes 

Ri 

Ki 

d f (i) 


1/2-C 

6 

6 


3/5-B 

- 

3 

(20,131,6) 

4/5-B 

- 

2 


4/5-B 

— 

2 


1/4-C 

4 

8 


1/2-C 

4 

4 

(20, lOf, 8) 

3/5-B 

- 

3 


4/5-B 

— 

2 


1/4-C 

5 

10 


1/2-C 

5 

5 

(20, 9±, 10) 

1/2-C 

4 

4 


3/5-B 


3 


1/4-C 

6 

12 


1/2-C 

6 

6 

(20, 91, 12) 

1/2-C 

4 

4 


3/5-B 

- 

3 


Note: In the table, the letter B denotes a block code over GF( 4) (an extended Reed-Solomon 
code), and C denotes a binary-to-4-ary convolutional code from [15]. 

6 Fast Coding and Decoding 

Coding and decoding schemes for multi-level codes based on two way partition chains were 
first presented by Imai and Hirakawa [1]. These were later generalized by Pottie and Taylor 
[5], Tanner [4], and Wu [11]. For simplicity, we take the construction A block codes as an 
example to illustrate the principles of encoding and decoding, which also apply to the other 
codes discussed above. 

From the structure of the encoder shown in Figure 1, the encoder consists of m component 
encoders and a mapper. Since known encoders can be used as component encoders, the 
major problem of encoding is to decrease the complexity of the mapping. From (16), (17), 
(2), and (3), the mapping can be implemented by computing Pi(x) in advance. Then the 
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coefficients of a codeword S'(x) can be obtained by taking the product of yj- and Pi(x ) for 
i = 0, .1, . . . , q — 2 and j = 0, 1, . . . , q — 2. Therefore, we must store the coefficients of Pi(x) 
in advance, for i = 1, 2, . . . , q — 2 (note that Po^) = 1)- Since the degree of P%{x) is i, only 
i symbols are required to store the coefficients of P,(x). The total number of symbols to be 
stored in the encoder is therefore q — 2 )(q — 1). From the following decoding procedure, 
we will see that the coefficients of Pi(x) should also be stored in the decoder. 

Assume that the receiver makes hard decisions and let S'(x) be a received codeword over 
GF(q). Then the decoding procedure is as follows: 

Step 1. For j = 0, 1, . . . , n 2 — 1, let £p\x) = 5 J (x) and set yj°* = 5^(1). 

Then (yo°\ 2?{°\ • • • , * s the decoder input for code Co, and the output is denoted by 

Step 2. (2 < i < q — 1) For j = 0, 1, . . . , — 1, let 


5. 0) (x) = 5 H(x) - P,_ 2 (x)y} 


(<- 2 ) 


(26) 


and set 


yf~ l) = S^id'- 1 ). (27) 

,y^* 2 “P) is the decoder input for code i, and the output is denoted 


Then (yj‘ 1} ,y!‘ a) , 

by 

[VO iVl > ■ • • i Vn-2^1 ) • 

Finally, the estimated information sequences I%(x) are obtained from the decoder esti- 
mates 

(vo \ y{'\ • • • , y$) , for i = 0, 1 , 2, . . . , q — 2, by applying the inverse of the encoder mapping. 

As shown by Tanner [4], the above decoding procedure can achieve the lower bound 
distance d of (13). If the design distance d < q — 1 in the above decoding procedure, only 
the first d — 1 steps are needed, and for the remaining q — d steps, 


(y ( o = ($0 ^yi' 1) i---»y&_V) » for i = d- l,d- 2 ,...,< 7- 1. (28) 
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Also, if d > q — 1 , the first [ steps can be omitted. 

From the decoding procedure, we can see that the decoding complexity is the sum of 
the complexity of computing (26) and (27) and the decoding complexity of each component 
code. The complexity of computing (26) and (27) is relatively small since the coefficients of 
the polynomials Pi(x ) are computed in advance, and in most cases the decoding complexity 
is dominated by the component code whose decoding complexity is the highest among all 
component codes. 

7 Conclusions 

We have applied set partitioning to multi-dimensional signal spaces over GF(q) to construct 
powerful <7- ary block and trellis codes. Many of these codes have a better trade-off of min- 
imum distance, information rate, and decoding complexity than previously known <7- ary 
codes. A fast decoding algorithm based on hard decision multi-stage decoding has been 
presented. (A decoding algorithm based on soft decisions appears to be quite complex at 
this time.) 

Although only Reed-Solomon codes, shortened extended Reed-Solomon codes, and ex- 
tended Reed-Solomon codes are used as component codes in the block code constructions, 
other 9-ary codes, such as 9-ary BCH codes, can also be used as component codes, possibly 
resulting in longer and better codes. 

The trellis codes constructed have better performance, but more decoding complexity, 
than the block codes. A comparison of performance vs. decoding complexity between these 
new block and trellis codes will be an interesting subject for further study. 
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Appendix C 

A Hybrid M- Algorithm/Sequential Decoder 
for Convolutional and Trellis Codes 



